Intentional Voice Command Detection for Trigger-Free Speech Interface
نویسندگان
چکیده
منابع مشابه
Intentional voice command detection for completely hands-free speech interface in home environments
We introduce a new class of speech processing, called Intentional Voice Command Detection (IVCD). It is necessary to reject not only noises but also unintended voices to achieve completely hands-free speech interface. Conventional VAD framework is not sufficient for such purpose, and we discuss how we should define IVCD and how we can realize it. We investigate implementation of IVCD from the v...
متن کاملSpeech Shift: Speech Input Interface Using Intentional Control of Voice Pitch
あらまし 本論文では,非言語情報の一つである音高を利用した,「音声シフト」という新たな音声入力インタ フェース機能を提案する.従来の音声認識システムが主に言語情報だけを利用してきたのに対し,我々は非言語 情報を積極的に活用することによって,音声のもつ潜在能力を引き出した使いやすいインタフェースを構築する ことを目指している.音声シフトでは,普通に発声した発話と故意に高く発声した発話を異なる入力モードに割 り当てることで,音声のみでモード指定と情報入力とを同時に行うことを可能にする.例えば,音声ディクテー ションにおいて,「改行」と普通に発声するとその文字が入力され(文字入力モード),それを高く発声すると行 末が改行される(コマンドモード)機能が実現できる.こうした機能を実現するために,本研究では,故意に高 い発声を識別する際に必要となる話者ごとの音高の基準を,有声休止区間の音高を用い...
متن کاملCommand Speech Interface to Virtual Reality Applications
During last five years several attempts to develop the speech interface to especially simulation applications emerged due to the recent improvements in speech and language technology and the complexity of those application’s interfaces. We describe our approach to control Virtual Reality applications via voice and GUI, in creation of simple multimodal command speech interface based on dialog mo...
متن کاملVoice activated command and control with speech recognition over WiFi
This paper presents work conducted to date on the development of a voice activated command and control framework specifically for the control of remote devices in a ubiquitous computing environment. The prototype device is a Java controlled Lego Mindstorm robot. The research considers three different scenario configurations. A recognition grammar for command and control of the robot has been cr...
متن کاملSpeech shift: direct speech-input-mode switching through intentional control of voice pitch
This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2010
ISSN: 0916-8532,1745-1361
DOI: 10.1587/transinf.e93.d.2440